Uniform Glivenko-Cantelli Theorems and Concentration of Measure in the Mathematical Modelling of Learning

نویسنده

  • Martin Anthony
چکیده

This paper surveys certain developments in the use of probabilistic techniques for the modelling of generalization in machine learning. Building on ‘uniform convergence’ results in probability theory, a number of approaches to the problem of quantifying generalization have been developed in recent years. Initially these models addressed binary classification, and as such were applicable, for example, to binary-output neural networks. More recently, analysis has been extended to apply to regression problems, and to classification problems in which the classification is achieved by using real-valued functions (in which the concept of a large margin has proven useful). In order to obtain more useful and realistic bounds, and to analyse model selection, another development has been the derivation of datadependent bounds. Here, we discuss some of the main probabilistic techniques and key results, particularly the use (and derivation of) uniform Glivenko-Cantelli theorems, and the use of concentration of measure results. Many details are omitted, the aim being to give a high-level overview of the types of approaches taken and methods used.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Preservation Theorems for Glivenko-cantelli and Uniform Glivenko-cantelli Classes Aad Van Der Vaart and Jon

We show that the P Glivenko property of classes of functions F1; : : : ;Fk is preserved by a continuous function ' from R k to R in the sense that the new class of functions x! '(f1(x); : : : ; fk(x)); fi 2 Fi; i = 1; : : : ; k is again a Glivenko-Cantelli class of functions if it has an integrable envelope. We also prove an analogous result for preservation of the uniform Glivenko-Cantelli pro...

متن کامل

Uniform Glivenko–Cantelli Classes

A class of sets, or functions, is said to be P–Glivenko–Cantelli if the empirical measure Pn converges in some sense to the true measure, P , as n → ∞, uniformly over the class of sets or functions. Thus, the notions of Glivenko–Cantelli, and likewise uniform Glivenko–Cantelli are for the most part qualitative assessments of how “well–behaved” a collection of sets or functions is, in the sense ...

متن کامل

Preservation Theorems for Glivenko-Cantelli and Uniform Glivenko-Cantelli Classes

We show that the P−Glivenko property of classes of functions F1, . . . ,Fk is preserved by a continuous function φ from R to R in the sense that the new class of functions x → φ(f1(x), . . . , fk(x)), fi ∈ Fi, i = 1, . . . , k is again a Glivenko-Cantelli class of functions if it has an integrable envelope. We also prove an analogous result for preservation of the uniform Glivenko-Cantelli prop...

متن کامل

A Counterexample Concerning the Extension of Uniform Strong Laws to Ergodic Processes

We present a construction showing that a class of sets C that is Glivenko-Cantelli for an i.i.d. process need not be Glivenko-Cantelli for every stationary ergodic process with the same one dimensional marginal distribution. This result provides a counterpoint to recent work extending uniform strong laws to ergodic processes, and a recent characterization of universal Glivenko Cantelli classes.

متن کامل

Some Converse Limit Theorems for Exchangeable Bootstraps

The bootstrap Glivenko-Cantelli and bootstrap Donsker theorems of Giné and Zinn (1990) contain both necessary and sufficient conditions for the asymptotic validity of Efron’s nonparametric bootstrap. In the more general case of exchangeably weighted bootstraps, Praestgaard and Wellner (1993) and Van der Vaart and Wellner (1996) give analogues of the sufficiency half of the Theorems of Giné and ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002